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SHAPE DESCRIPTOR EXTRACTING METHOD 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to a shape descriptor extracting 
method, and more particularly, to a shape descriptor extracting method 
based on an image skeleton. The present invention is based on 
Korean Patent Application No. 2000-62163 which is incorporated 
herein by reference. 

2. Description of the Related Art 

A shape descriptor is based on a lower abstraction level 
description enabling an automatic extraction, and is a basic descriptor 
which humans can perceive from an image. Algorithms, which 
describe the shape of a specific object within an image and measure 
the degree of matching or similarity based on the shape, are studied. 
However, the algorithms only describe the shapes of the specific 
objects, so that there are many problems in perceiving the shapes of 
general objects. Currently, shape descriptors, suggested by a 
standard group, such as MPEG-7, are obtained by looking for features 
through various transformations of the given objects to solve the above 



problem- 
There are many kinds of shape descriptors. Two shape 
descriptors adopted in experimental Model 1 (XM) of MPEG-7 are 
known as a Zernike moment shape descriptor and a curvature scale 
space shape descriptor. 

As for the Zernike moment shape descriptor, Zernike basis 
functions are defined for a variety of shapes to investigate the shape of 
an object within an image. Then, the image of fixed size is projected 
over the basis functions, and the resultant values are used as the 
shape descriptors. 

As for the curvature scale space descriptor, the contour of a 
model image is extracted, and changes of curvature points along the 
contour are expressed on a scaled space. Then, the locations with 
respect to the peak values are expressed as a z-dimensional vector. 
However, to extract the former descriptor, the sizes of input images are 
restricted. Meanwhile, to extract the latter shape descriptor, the 
extracted shape must be only one object. 

SUMMARY OF THE INVENTION 
To solve the above problems, it is an objective of the present 
invention to provide a shape descriptor extracting method which can be 
effectively applied to a motion video compression technique and an 



image searching technique based on the motion video compression 
technique. 

it is another objective of the present invention to provide an 
image searching method which searches an image similar to query 
images within images indexed, using shape descriptors extracted by 
the shape descriptor extracting method. 

It is another objective of the present invention to provide a 
dissimilarity measuring method which measures dissimilarity between 
images to be indexed, using shape descriptors extracted by the shape 
descriptor extracting method. 

Accordingly, to achieve the above objectives, there is provided a 
shape descriptor extracting method according to one aspect of the 
present invention including: (a) determining a shape descriptor based 
on an extracted skeleton by extracting a skeleton of images. 

Also, to achieve the above objectives, there is provide a shape 
descriptor extracting method according to another aspect of the 
present invention including: (a) extracting a skeleton from input images; 
(b) obtaining a list of straight lines by performing a connection of pixels 
based on the extracted skeleton; and (c) determining a regular list of 
straight lines obtained by normalizing the list of straightjines as a 
shape descriptor. 

Also, the step (a) preferably includes: (a-1) obtaining a distance 
map by performing a distance transform on input images; and (a-2) 



extracting a skeleton from the obtained distance map. 

Also, the step (b) preferably includes: (b-1) thinning the 
extracted skeleton; and (b-2) extracting straight lines by connecting 
each pixel within the thinned skeleton. 

Also, the step (c) preferably includes: (c-1) drawing out a list of 
connected beginning and end points; (c-2) obtaining a first list of 
straight lines by straight-combining extracted straight lines; and (c-3) 
determining a second list of straight lines obtained by normalizing the 
first list of straight lines based on a maximum distance between ending 
points of each straight line. 

Also, the distance transform is preferably based on a function 
showing each point of the inside of an object as a value of a minimum 
distance from a background. 

Also, the step (a-2) preferably includes: obtaining a local 
maximum from the distance map using an edge detecting method. 

Also, the step (a-2) preferably includes: (a-2-1) performing a 
convolution using a local maximum detecting mask of four directions to 
obtain a local maximum. 

Also, after the step (a-2-1), it is preferable to further include: (a- 
2-2) recording a label corresponding to a direction having the -greatest 
size in a direction map and a magnitude map. 

Also, it is preferable that the input images are binary images. 

Also, it is preferable that the step (b-1) further includes: leaving 



the biggest pixel in the direction rotated by 90-degrees front the 
corresponding direction and removing the rest of the pixels. 

Also, it is preferable that the step (c-2) further includes: drawing 
out a list of beginning and end points of each line segment by 
connecting pixels having the same label in the direction map, using a 
direction map having four directions. 

Also, it is preferable that the step (c-2) further includes: 
performing a straight line combination by changing a threshold value of 
an angle between each straight line, a distance, and a length of a 
straight line from the obtained first list of straight lines. 

Also, it is preferable that the straight line combination is 
repeated until the number of remaining straight lines becomes equal to 
or less than a predetermined number. 

Also, to achieve the above objectives, there is provided an 
image searching method according to the present invention which 
includes: (a) obtaining a list of straight lines from a shape descriptor of 
a query image; (b) obtaining dissimilarity by comparing a list of straight 
lines of a shape descriptor of a detected image with a list of straight 
lines of a shape descriptor of a query image. 

Also, to achieve the above objectives, there is provided a 
dissimilarity measuring method, wherein a method for measuring 
dissimilarity between images indexed using a shape descriptor formed 
on the basis of a skeleton includes: (a) obtaining a list of straight lines 



from a shape descriptor of a query image; and (b) comparing a (fst of 
straight lines of a shape descriptor of a detected image with that of the 
shape descriptor of the query image, and obtaining dissimilarity. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The above objectives and advantages of the present invention 

will become more apparent by describing in detail a preferred 

embodiment thereof with reference to the attached drawings in which: 
FIG. 1 is a flowchart illustrating main steps of extracting a shape 

descriptor according to a preferred embodiment of the present 

invention; 

FIGS. 2A through 2D are drawings illustrating examples of 
masks for detecting a local maximum; 

FIG. 3A is a drawing illustrating an example of a binary image; 

FIG. 3B is a drawing illustrating a distance map scaled from a 
black-and-white image; 

FIG. 3C is a drawing illustrating a skeleton image; 

FIG. 3D is a drawing illustrating a thinned skeleton image; 

FIG. 3E is a drawing illustrating the result of a straight line 
approximation; 

FIG. 4 is a flowchart illustrating the main steps of an image 
searching method based on a shape descriptor according to a 
preferred embodiment of the present invention ; and 



FIGS. 5 and 6 are drawings illustrating the results of* trial 
experiments on binary images which are used as experimental images 
for an experimental model (XM) version of MPEG-7 standard in order 
to evaluate the performance of an image searching method according 
to the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
Hereinafter, preferred embodiments of the present invention will 
be described in greater detail with reference to the appended drawings. 

According to the present invention, a shape descriptor using a 
skeleton is defined. The shape descriptor based on the skeleton is 
obtained by extracting a line, which is a basis of perception for 
humans, from a given shape, and by simplifying the extracted line. 
Particularly, according to the shape descriptor extracting method, the 
shape descriptor can be simplified by extracting a skeleton rather than 
an edge. 

FIG. 1 is a flowchart illustrating the main steps of the shape 
descriptor extracting method according to a preferred embodiment of 
the present invention. Referring to FIG. 1, in the shape descriptor 
extracting method according to a preferred embodiment of the present 
invention, first, an image is input (step 102), and a distance transform 
is performed on the input image to obtain a distance map (step 104). 
The distance transform used to obtain the distance map uses a 



function which indicates respective points within an objective as the 
shortest distance value from the background. Next, a skeleton is 
extracted from the distance map (step 106). It is well-known that a 
local maximum in the distance map is a point of a skeleton. The 
distance transform used to obtain the distance map is based on a 
function which indicates respective points within an objective as the 
shortest distance value from the background. In a preferred 
embodiment, the local maximum in the distance map is determined as 
a skeleton by the distance transform. To obtain the local maximum 
from the distance map, in a preferred embodiment, it is possible to use 
an edge detecting method which is used in "Linear Feature Extraction 
and Description" (R. Nevatia and K. R. Babu, Computer Graphics and 
Image Processing, Vol. 13, pp. 257-269, 1980), incorporated herein by 
reference. FIGS. 2A through 2D illustrate examples of a mask for 
detecting the local maximum. Referring to FIGS. 2A through 2D, 
masks for detecting the local maximum of four-directions are used for 
detecting the local maximum. FIG. 2A is a mask corresponding to the 
direction of 0 degrees. FIG. 2B is a mask corresponding to the 
direction of 45 degrees. FIG. 2C is a mask corresponding to the 
direction of 90 degrees. FIG. 2D is a mask corresponding to the 
direction of 135 degrees. Then, a convolution is performed using the 
masks. As a result, a label corresponding to the direction having the 
greatest size is recorded on a direction map and a magnitude map. 



Hereby, the local maximum is obtained on the distance map obtained 
by the distance transform from the binary image illustrated in FIG. 3A, 
so that the skeleton is extracted. 

Next, the extracted skeleton is thinned (step 108). The thinning 
can be performed by, for example, leaving a pixel having the greatest 
size in the direction rotated by 90-degrees from the corresponding 
direction on the direction map and removing the rest of the pixels. FIG. 
3D illustrates an example of a thinned skeleton image. 

Next, straight lines are extracted by connecting respective pixels 
within the thinned skeleton (step 110). That is, the respective pixels 
within the thinned skeleton are connected along one direction, and 
straight lines are extracted by making a list of starting and end points of 
the line. In a preferred embodiment, the direction maps of four 
directions illustrated in FIGS. 2A through 2D are used, and pixels 
having the same level on the direction map are connected to make a 
list of starting and end points of respective line segments. 

Next, a list of straight lines is obtained by straight line 
combination of the extracted straight lines (step 112). That is, 
changing threshold values of angle, distance, and length between 
respective straight lines from the obtained list of straight lines, the 
straight line combination is performed. The straight line combination is 
repeated until the number of remaining straight lines becomes equal to 
or less than the predetermined number. FIG. 3E illustrates the result of 



the straight line approximation. Then, a list of straight lines obtained by 
normalizing a list of straight lines based on a maximum distance 
between the ending points of respective straight lines is determined as 
a shape descriptor (step 114). That is, according to the shape 
descriptor extracting method, the skeleton of the binary image is 
extracted, and the extracted skeleton is used as the shape descriptor. 

According to the shape descriptor extracting method, the 
skeleton of the binary image is extracted as the shape descriptor, and 
the extracted shape descriptor can be used for the combination of 
images. Also, in the shape descriptor extracting method, the skeleton 
is extracted from the binary image, and the extracted skeleton is 
approximated as a straight line. Also, to effectively extract straight 
lines, the binary image is distance-transformed, and the local maximum 
is obtained to extract the skeleton. The extracted skeleton is 
approximated as a certain number of straight lines using the edge 
extracting method. The number of approximated straight lines is 
limited to a certain number, so that it is possible to perform a further 
faster matching. 

Hereinafter, a method for searching for images similar to query 
images from a database which stores images indexed by the shape 
descriptor extracting method will be described. Also, an effect of the 
shape descriptor extracting method will be described by evaluating the 
performance of searching for images similar to query images within the 



- * image database including images indexed using the shape descriptor 
extracted by the shape descriptor extracting method described with 
reference to FIG. 1. 

FIG. 4 is a flowchart illustrating the main steps of the image 

> searching method according to the present invention. First, a list of 
straight lines is obtained from the shape descriptor of the query image 
(step 402). Next, dissimilarity is obtained by comparing the list of 
straight lines of the shape descriptor of the detected image with that of 
the shape descriptor of the query image (step 404). 

o In the preferred embodiment, the distances between the ending 

points of the straight lines forming the skeleton are measured, and the 
sum of the minimum values of the measured distances is determined 
as a dissimilarity value. In a dissimilarity specific function, when N, D 1k , 
and D 2 k are respectively, 
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Here, Q denotes a straight line to be detected, M denotes a 
detected straight line, S denotes a starting point of each straight line, E 
is an ending point of each straight line, N Q is the total number of 
straight lines which the shape descriptor of the query image has, N M is 
the total umber of straight lines which the shape descriptor of the 
detected image has. 

Referring to formula 4, the sum of the minimum value of the 
distances between straight lines measured by formulas 2 and 3 is 
determined as dissimilarity of two descriptors. That is, the smaller the 
result value of formula 4 is, the more similar two objects are regarded 
as being. Also, it is possible to obtain a value which does not change 
with respect to rotation by performing the measurement at a regular 
interval of a rotating angle. 



Now, images having shape characteristics similar to the query 
image are searched for on the basis of dissimilarity obtained in the step 
404. The image having the least dissimilarity with respect to the query 
image among the searched images, is determined as a final searched 
image. The searching method based on dissimilarity is called a 
matching method, and the final searched image is called a matched 
image. 

To evaluate the performance of the method, a trial experiment is 
performed on the binary images used as experimental images of an 
experimental model (XM) version of MPEG-7 standard. Various 
threshold values for the straight line combination are experientially 
decided. The straight line combination is only performed at an angle of 
30 degrees, and the distance between ending points of the two straight 
lines, which are straight line combined, is decided as 5% of the smaller 
value among the width and length of the real image, and the length of 
the straight line is neglected after the straight line combination is 
decided as 1% of the greater value among the width and length. Also, 
the threshold value increases by 10% at every repeated performance, 
and the number of the straight lines becomes equal to or less than 10. 

The result of the experiment is illustrated in FIGS. 5 and 6. 
Referring to FIG. 5, the image searching method according to the 
present invention does not show good searching performance when 
searching for images having a similar shape to the query image from 
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the images which are not classified at all. This is because information 
of the detailed portion is lost during the approximation process for 
making the straight lines. Also, referring to FIG. 6, the image searching 
method shows very good searching performance when searching for 
the classified images, that is images having similar shape to the query 
image, from the data collection of the same category. Therefore, the 
shape descriptor extracting method is advantageous for extracting local 
motion in the data of the same category. The reason why the method 
is advantageous for extracting local motion of the same object is that 
the shape descriptor extracted by the shape descriptor extracting 
method of the present invention possesses information about 
schematic features of the shape included in the image. 

In the above preferred embodiments, a method for searching for 
images, having a similar shape to the query image with respect to the 
images indexed by the shape descriptor extracting method described 
with reference to FIG. 1, is described. However, in the image 
searching method, a step of measuring dissimilarity between the query 
image and the searched image can also be applied to grouping images 
having similar shapes on the basis of the measured dissimilarity. 

The shape descriptor extracting method can be applied to a 
moving image compression technique on the basis of standards such 
as objective-based compression techniques, MPEG-4, MPEG-7, and 
MPEG-21. Also, it can be effectively applied to the image searching 
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technique based on the motion video compression technique. 

Also, the shape descriptor extracting method and image 
searching method according to the present invention can be written as 
a program executed on a personal or server computer. Program codes 
and code segments constructing the program can be easily inferred by 
computer programmers skilled in the art. Also, the program can be 
stored in computer-readable recording media. The recording media 
may be magnetic recording media, optical recording media, or radio 
media. 

Since the shape descriptor extracted by the shape descriptor 
extracting method according to the present invention possesses 
information about schematic features of the shape included in the 
image, local motion can be effectively extracted in the data collection of 
the same category. Also, the image searching method, which 
searches for images having similar shapes to the query image within 
the image data base indexed by the shape descriptor extracting 
method, has very good searching performance when searching for 
images having simitar shapes to the query image from the classified 
images. 



